Ultra-stemming and Statistical Summarization at INEX 2013 Tweet Contextualization Track

نویسندگان

  • Juan-Manuel Torres-Moreno
  • Patricia Velázquez-Morales
چکیده

According to the organizers, the objective of the 2013 INEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present the Cortex summarizer applied to the INEX 2013 task. Cortex summarizer uses several sentence selection metrics and an optimal decision module to score sentences from a document source. The results show that Cortex system performs well on INEX task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Testing a Statistical Word Stemmer based on Affixality Measurements in INEX 2012 Tweet Contextualization Track

This paper presents an experiment of statistical word stemming based on a xality measurements. These measurements quantify three characteristics of language. In this experiment we tested one strategy of stemming with three di erent sizes of training data. The developed stemmer was used by the automatic summarization system Cortex to preprocess input texts and produce readable summaries. All sum...

متن کامل

Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track

According to the organizers, the objective of the 2014 CLEFINEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present three statistical summarizer systems ap...

متن کامل

An Automatic Greedy Summarization System at INEX 2013 Tweet Contextualization Track

According to the organizers, the aim of the 2013 INEX Tweet Contextualization Track is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present an automatic greedy summarizer named REG applied to...

متن کامل

A Hybrid Tweet Contextualization System using IR and Summarization

The article presents the experiments carried out as part of the participation in the Tweet Contextualization (TC) track of INEX 2012. We have submitted three runs. The INEX TC task has two main sub tasks, Focused IR and Automatic Summarization. In the Focused IR system, we first preprocess the Wikipedia documents and then index them using Nutch with NE field. Stop words are removed and all NEs ...

متن کامل

Tweet Contextualization using Continuous Space Vectors: Automatic Summarization of Cultural Documents

In this paper we describe our participation in the INEX 2016 Tweet Contextualization track. The tweet contextualization process aims at generating a short summary from Wikipedia documents related to the tweet. In our approach, we analyzed tweets and created a query to retrieve the most relevant Wikipedia article. We combine Information Retrieval and Automatic Text Summarization methods to gener...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013